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DETAILED ACTION 

1 . Claims 22-42 have been examined. 

Papers Submitted 

2. It is hereby acknowledged that the following papers have been received and placed of 
record in the file: RCE and Amendment as received on 3/1/2005. 

Drawings 

3. The drawings are objected to for the following reason: In the paragraph appearing on 
page line, beginning on line 19 and continuing to page 10, line 3, it is appears as if each entry 
within the history table has a corresponding replacement field. However, the amended Fig. 1, 
shows just a random unlabeled single box which is supposed to represent the replacement field. 
The drawing should be amended more appropriately such that each history field has a 
corresponding replacement field. A proposed drawing correction or corrected drawings are 
required in reply to the Office action to avoid abandonment of the application. The objection to 
the drawings will not be held in abeyance. 

Claim Objections 

4. Claim 22 is objected to because it is not clear if applicant, desires to change "is used" in 
line 3 to —was used— like in claim 31. The specification (last paragraph on page 9) appears to 
support the language "was used". Appropriate correction is required. 
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5. Claim 30 is objected to because of the following informalities: Please insert --a— before 
"history multiplexer" in the fourth to last line. Appropriate correction is required. 

6. Claim 37 is objected to because of the following informalities: Please insert —a— before 
"history multiplexer" in the fourth to last line. Appropriate correction is required. 

Maintained Rejections 

7. Applicant has failed to overcome the prior art rejections set forth in the previous Office 
Action for claims 27-30 and 32-39. Consequently, these rejections are respectfully maintained 
by the examiner and are copied below for applicant's convenience. 

Withdrawn Rejections 

8. Applicant has overcome the rejections set forth in the previous Office Action for claims 
22-26, 31, and 40-42. Consequently, these rejections are hereby withdrawn by the examiner. 
However, upon further consideration, a new ground of rejection is applied below. 

Claim Rejections - 35 USC §102 

9. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that form the 
basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(e) the invention was described in (1) an application for patent, published under section 122(b), by another filed 
in the United States before the invention by the applicant for patent or (2) a patent granted on an application for 
patent by another filed in the United States before the invention by the applicant for patent, except that an 
international application filed under the treaty defined in section 35 1(a) shall have the effects for purposes of this 
subsection of an application filed in the United States only if the international application designated the United 
States and was published under Article 21(2) of such treaty in the English language. 



Application/Control Number: 09/535,765 Page 4 

Art Unit: 2183 

10. Claims 27-29 are rejected under 35 U.S.C. 102(e) as being anticipated by McFarling, 
U.S. Patent No. 6,374,349 (as applied in the previous Office Action). 

1 1 . Referring to claim 27, McFarling has taught a processor comprising: 

a) an instruction pointer (IP) generator capable of providing an address. See column 3, lines 12- 
16. Note that McFarling makes reference to a program counter, which performs the same 
function as an IP generator (i.e. they both provide an address at which an instruction will be 
fetched). 

b) saturating counter branch prediction (SCBP) logic having an input coupled to the IP generator 
and capable of providing a first taken/not-taken prediction at an output responsive to the address. 
See column 9, lines 10-16. It has been disclosed that a bimodal predictor always provides a 
prediction. Furthermore, from column 3, lines 1 1-28, the bimodal predictor is shown to be an 
array of saturated counters. Furthermore, column 3, lines 12-16, explain that the SCBP logic is 
coupled to the program counter (IP generator). 

c) local branch history prediction (LBHP) logic having an input coupled to the IP generator and 
capable of providing (1) a second taken/not-taken prediction at an output responsive to the 
address resulting in a hit, and (2) a hit/miss indication for the address, wherein the LBHP logic 
includes at least one local branch history table to provide a taken/not-taken history from a 
matching entry in said table in response to a hit. See column 9, lines 18-24, and Fig. 10, and note 
that a second prediction is provided (line 102 in Fig. 10). Also, in column 4, lines 15-19, 
McFarling has explained that the local history table is indexed based on a portion of the branch 
instruction address (tag). Therefore, the local history table is also coupled to the IP generator. It 
should be realized that the LBHP includes at least one local branch history table to provide a 
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taken/not-taken history in response to a hit. From Fig. 10, it can be seen that the tag/history data 
is stored in one table 100 and the prediction (CNT) is stored in a second table. The most 
significant bit of CNT is used to make a taken/not-taken prediction. Finally, it should be realized 
that the taken/not-taken history is provided from a matching entry (storage location) in the local 
table. This matching entry is the location that the branch instruction's address maps to. 

d) combinational logic that is predetermined at the time the processor is built to implement 
without memory a decision function whose output is the second prediction based on the 
taken/not-taken history. It should be realized that the second prediction is generated as the final 
prediction by the multiplexer shown in Fig. 10 when there is a hit in the local table. This 
multiplexer is combinational logic without a memory that performs the decision function: 

pred = (pred B AND hit') OR (pred H AND hit) 

More specifically, the final prediction (pred) generated by the multiplexer is either the prediction 
from the bimodal predictor (preds) if there is miss (hit') in the local history table or the 
prediction from the local history table (predn) if there is a hit in the local history table. And, 
combinational logic is just a logic circuit whose outputs depend only on the present logic inputs. 
Clearly, the multiplexer of Fig. 10, outputs a value which is dependent on its inputs (see the 
formula above). 

e) a multiplexer having an input coupled to the outputs of the SCBP and LBHP logic and a select 
input coupled to receive the hit/miss indication and in response provide (1) the second prediction 
if there is a hit and (2) the first prediction if there is a miss. See Fig. 10 and note that the 
multiplexer has inputs from both the SCBP (104) and LBHP (104) and that one of the inputs is 
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chosen based on the hit signal that is used as a select line for the multiplexer. More specifically, 
from column 9, lines 10-24, if a history-table miss occurs, the provisional bimodal counter 
prediction will be used. On the other hand, if a history-table hit occurs, the second prediction 
from the history table will be generated as the final prediction. 

12. Referring to claim 28, McFarling has taught a processor as described in claim 27. 
McFarling has further taught address hash logic coupled between the EP generator and the inputs 
of the SCBP and LBHP logic to provide a plurality of index values to at least one of the SCBP 
and LBHP logic. Again, as discussed in column 3, lines 12-16, and column 4, lines 15-19, the 
prediction logic is provided an index by performing a truncation hashing function. More 
specifically, it has been taught (in the above passages) that the prediction logic receives a certain 
amount of low order branch instruction address bits as an index instead of the entire address. 
Therefore, the subset of wires used to carry the index from the IP generator to the prediction 
logic is address hash logic that performs a truncation algorithm. This logic transforms an initial 
value (branch instruction address) into another value used for locating corresponding data in a 
structure (SCBP and LBHP). 

13. Referring to claim 29, McFarling has taught a processor as described in claim 27. 
McFarling has further taught that the SCBP logic includes a bimodal predictor. See column 9, 
lines 10-24. 

Claim Rejections - 35 USC §103 

14. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 
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(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

15. Claims 22-23 and 40-41 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
McFarling, as applied above, in view of Hunt, U.S. Patent No. 6,151,672 (as previously cited). 

16. Referring to claim 22, McFarling has taught a method comprising: 

a) providing a first taken/not-taken prediction responsive to an address using a saturating counter 
branch predictor. See column 9, lines 10-16. It has been disclosed that a bimodal predictor 
always provides a provisional prediction, the prediction being provisional because it is provided 
for use by the system but the system may not choose the prediction as the final prediction. 
Furthermore, from column 3, lines 1 1-28, the bimodal predictor is shown to be an array of 
saturated counters. 

b) providing (1) a second taken/not-taken prediction responsive to the address resulting in a hit in 

a local branch history table, and (2) a hit/miss indication for the address. See column 9, lines 18- 

( 

24, and Fig. 10, and note that a second prediction is provided (line 102 in Fig. 10). 

c) McFarling has not taught that the second prediction is generated by a decision function that 
accepts as an input a history value from a matching entry of the local branch history table and 
produces as an output the second prediction, the decision function being implemented by 
combinational logic that has no memory so as to reduce on-chip area. However, Hunt has taught 
such a concept. See Fig. 2 of Hunt and note combinational logic 216 which accepts as an input a 
history value 226. Based in part on the history value, logic 216 generates the second prediction 
224 by performing a decision function (XOR operation). Clearly, an XOR gate 216 has no 
memory, which would reduce on-chip area, and Hunt has taught that such a prediction scheme is 
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useful because two well-behaved branches may share an entry in a branch history table while not 
interfering with one another. See Hunt's abstract. McFarling, meanwhile, notes that branches 
may map to similar locations in a table. See McFarling, column 3, lines 50-58. As a result, in 
order to lower interference in collision situations, it would have been obvious to one of ordinary 
skill in the art at the time of the invention to modify McFarling to include combinational logic 
with no memory for generating a second prediction based on a decision function, as taught by 
Hunt. 

d) selecting for the address one of (1) the second prediction if the indication is a hit, and (2) the 
first prediction if the indication is a miss. Again, see column 9, lines 10-24, and Fig. 10. If a 
history-table miss occurs, the bimodal counter prediction will be used. On the other hand, if a 
history-table hit occurs, the second prediction from the history table will be generated as the final 
prediction. 

17. Referring to claim 23, McFarling in view of Hunt has taught a method as described in 
claim 22. McFarling has further taught hashing the address prior to indexing at least one of the 
saturating counter branch predictor and the local branch history table. See column 3, lines 12-16, 
and column 4, lines 15-19. Recall that hashing is simply the function of converting a value (in 
this case, the branch instruction address) into a value used for locating corresponding data in a 
structure (in this case, the saturating counter predictor and the local history predictor). In the 
system of McFarling, a portion of a fiill branch instruction address is used to access a prediction 
entry in a corresponding structure would be considered hashing. 

18. Referring to claim 40, McFarling has taught a method comprising: 
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a) providing a first taken/not-taken prediction responsive to an address using a saturating counter 
branch predictor. See column 9, lines 10-16. It has been disclosed that a bimodal predictor 
always provides a prediction. Furthermore, from column 3, lines 1 1-28, the bimodal predictor is 
shown to be an array of saturated counters. 

b) providing a second taken/not-taken prediction responsive to the address resulting in a hit in a 
local branch history table. See column 9, lines 18-24, and Fig. 10. Note in Fig. 10 that if a hit in 
the local table occurs, a second prediction is provided on line 102. 

c) McFarling has not taught that the second prediction is generated using a decision function that 
is provided with a taken/not-taken history value from a history field of a matching entry of the 
local branch history table and produces as output the second prediction, the decision function 
being implemented by combinational logic that has no counter or state machine. However, Hunt 
has taught such a concept. See Fig. 2 of Hunt and note combinational logic 216 which accepts as 
an input a history value 226. Based in part on the history value, logic 216 generates the second 
prediction 224 by performing a decision function (XOR operation). Clearly, an XOR gate 216 
has no memory, which would reduce on-chip area, and Hunt has taught that such a prediction 
scheme is useful because two well-behaved branches may share an entry in a branch history table 
while not interfering with one another. See Hunt's abstract. McFarling, meanwhile, notes that 
branches may map to similar locations in a table. See McFarling, column 3, lines 50-58. As a 
result, in order to lower interference in collision situations, it would have been obvious to one of 
ordinary skill in the art at the time of the invention to modify McFarling to include 
combinational logic with no memory for generating a second prediction based on a decision 
function, as taught by Hunt. 
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d) selecting for the address one of the first prediction and the second prediction depending on 
whether or not the address resulted in said hit. See column 9, lines 10-24, and Fig. 10. If a 
history-table miss occurs, the counter prediction will be used. On the other hand, if a history- 
table hit occurs, the history table prediction will be used. 

19. Referring to claim 41, McFarling in view of Hunt has taught a method as described in 
claim 40. McFarling has further taught: 

20. a) updating a replacement field for an existing entry in the local branch history table that 
includes a history field containing a plurality of taken/not-taken outcomes of one or more 
previously executed branch instructions associated with said address, only if the first prediction 
is incorrect. It should be noted that if there is a miss in the local table, then the bimodal predictor 
(saturating counter predictor) will make the prediction. According to column 9, lines 45-50, if 
the bimodal prediction is already correct, then no replacement will occur to the local table. 
Therefore, replacements will only occur when the first prediction from the bimodal predictor is 
incorrect. When it is incorrect, an entry within the local table (a replacement field which 
includes a tag, history, and valid field as shown in Fig. 17) will be replaced with information 
corresponding to the mispredicted branch. In other words, each entry within the table is a 
replacement field. Whatever entry is replaced is the matching entry as the information for the 
new branch will only be placed into a single entry. This entry would be determined by some 
replacement algorithm such as the least recently used (LRU) replacement algorithm. This is 
mentioned in column 8, lines 6-8. 
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21 . Claims 24 and 42 are rejected under 35 U.S. C. 103(a) as being unpatentable over 
McFarling in view of Hunt, as applied above, and further in view of Henry et al., U.S. Patent No. 
6,550,004 (as applied in the previous Office Action and herein referred to as Henry). 

22. Referring to claim 24, McFarling in view of Hunt has taught a method as described in 
claim 22. McFarling has further taught: 

a) updating a replacement field for a matching entry in the local branch history table only if the 
first prediction is incorrect, indicating that the entry is used to make a prediction. It should be 
noted that if there is a miss in the local table, then the bimodal predictor (saturating counter 
predictor) will make the prediction. According to column 9, lines 45-50, if the bimodal 
prediction is already correct, then no replacement will occur to the local table. Therefore, 
replacements will only occur when the first prediction from the bimodal predictor is incorrect. 
When it is incorrect, an entry within the local table (a replacement field which includes a tag, 
history, and valid field as shown in Fig. 17) will be replaced with information corresponding to 
the mispredicted branch. Whatever entry is replaced is the matching entry as the information for 
the new branch will only be placed into a single entry. This entry would be determined by some 
replacement algorithm such as the least recently used (LRU) replacement algorithm. This is 
mentioned in column 8, lines 6-8. This update indicates that the matching entry will be used to 
make a prediction in the future because the next time that same branch is encountered, a hit will 
occur in the local history table and the local table will provide the prediction as opposed to the 
incorrect bimodal prediction. 

b) McFarling has not explicitly taught updating said history field, with the outcome of an 
executed branch instruction, only if the first prediction is incorrect and the second prediction is 
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correct. However, Henry has taught such a concept. See Fig.6 and note that if the final (first) 
prediction is incorrect (step 624) and the non-selected (second) prediction is correct (step 628), 
then the corresponding history field within predictor 212 (Fig.2) will be updated (step 632). It 
should be noted from Fig.3 that component 212 does have history entries. Henry has further 
disclosed that such an update scheme advantageously yields improved branch prediction results. 
As a result, it would have been obvious to one of ordinary skill in the art at the time of the 
invention to modify McFarling to employ the update policy taught by Henry. 

23. Referring to claim 42, McFarling in view of Hunt has taught a method as described in 
claim 41. McFarling has not explicitly taught updating said history field based on an outcome of 
an executed branch instruction that is pointed to by said address, only if the first prediction is 
incorrect and the second prediction is correct. However, Henry has taught such a concept. See 
Fig.6 and note that if the final (first) prediction is incorrect (step 624) and the non-selected 
(second) prediction is correct (step 628), then the corresponding history field within predictor 
212 (Fig.2) will be updated (step 632). It should be noted from Fig.3 that component 212 does 
have history entries. Henry has further disclosed that such an update scheme advantageously 
yields improved branch prediction results. As a result, it would have been obvious to one of 
ordinary skill in the art at the time of the invention to modify McFarling to employ the update 
policy taught by Henry. 

24. Claims 25-26 are rejected under 35 U.S.C. 103(a) as being unpatentable over McFarling 
in view of Hunt, as applied above, and further in view of Hennessy and Patterson, Computer 
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Architecture - A Quantitative Approach, 2 nd Edition (as applied in the previous Office Action 
and herein referred to as Hennessy). 

25. Referring to claim 25, McFarling in view of Hunt has taught a method as described in 
claim 22. 

a) Furthermore, it is inherent that an instruction for which a prediction is made would have been 
fetched at the corresponding address. This concept is also supported in column 3, lines 12-16. 
The bits that are used to index the predictors are taken from an address that is stored in the 
program counter (PC). The PC is a register that is used to store a pointer to a location in memory 
from which an instruction is fetched. 

b) In addition, McFarling has taught that if the instruction is a branch, a determination as to 
whether the branch is taken or not-taken will not be available until the instruction has progressed 
beyond a decode stage. See column 1, lines 33-36, and note that the outcome of the branch is 
unknown until after the branch is executed. Since execution inherently follows decoding, the 
branch determination is not known until the instruction has progressed beyond the decode stage. 

c) Finally, although it is inherent that McFarling' s system decodes fetched instructions (in order 
to determine what operation would be performed), McFarling has not explicitly taught that at 
least one of the first and second predictions is available when the at least one instruction is being 
decoded. However, Hennessy has shown the state of a pipeline that implements branch 
prediction. See Figure 3.26 on page 167. Note that during the decode stage (ED) of the branch 
instruction, the predicted instruction (Instruction i+1) is being fetched in the fetch stage (IF). In 
order for the predicted instruction to be fetched during the decode stage of the branch instruction, 
the prediction must be available when the branch is being decoded. It can be seen that the 
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pipeline is kept full and free of stalls through the use of branch prediction. Furthermore, from 
column 1, lines 49-56, McFarling has explained that the purpose of a branch prediction scheme 
is to keep the pipeline full and free from stalls, which in fact is supported by the teachings of 
Hennessy. Therefore, it would have been obvious to one of ordinary skill in the art at the time of 
the invention to make the at least one of the predictions available when the at least one 
instruction is being decoded. This will ensure that the pipeline will stay full, as is desired by 
McFarling. It should be further noted that the scheme taught by Hennessy does not only apply to 
a system in which branch outcomes are known at the end of the decode stage. This can be 
realized since McFarling has taught a taken/not-taken prediction scheme in which branch 
outcomes are unknown until after the branch executes (column 1, lines 33-45). The general idea 
of such a prediction scheme is to keep the pipeline full regardless of when the branch outcome is 
"officially" known, and if the prediction is available while the branch is being decoded, then the 
next instruction can be fetched without stalling. 

26. Referring to claim 26, McFarling in view of Hunt and further in view of Hennessy has 
taught a method as described in claim 25. If the at least one instruction is a branch, then: 

a) it is inherent that the method further comprises determining a target address of the branch. If a 
target address were not determined for a conditional branch, then the system would not know 
where to fetch the next instruction from when the branch is predicted taken. 

b) it is inherent that the method further comprises loading an instruction pointer generator with 
the target address if at least one of the first and second predictions indicates that the branch is to 
be taken. In order to fetch the target instruction of a predicted-taken branch, the target address 
must be loaded into the IP generator (also known as the program counter (PC)). The IP 
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generator (PC) is a register that is used to store a pointer to a location in memory from which an 
instruction is fetched. For the applicant's benefit, please see Hennessy, page 162 and Figure 
3.22 on page 163. From the figure it can be seen that the PC (IP generator) is loaded with the 
output of a multiplexer. Inputs to the multiplexer include the next instruction address and the 
target address (which is outputted by the ADD logic in the ID stage of the pipeline). If the target 
address were not stored in the IP generator (PC), then the target instruction could not be fetched 
from instruction memory (as shown in Figure 3.22 of Hennessy). 

27. Claim 30 is rejected under 35 U.S.C. 103(a) as being unpatentable over McFarling, as 
applied above, in view of Gochman et al., U.S. Patent No. 5,842,008 (as applied in the previous 
office action and herein referred to as Gochman). 

28. Referring to claim 30, McFarling has taught a processor as described in claim 27. 

a) 1) McFarling has taught a single branch history table that provides a tag and a taken/not- 
taken history associated with the tag in response to a hit. See Fig. 1 0. McFarling has not 
explicitly taught the concept of using multiple branch history tables to provide a tag and a 
taken/not taken history associated with the tag in response to a hit. However, Gochman has 
taught the concept of using multiple memories that are simultaneously accessed based on an 
index. After each memory produces its respective data, a single value is selected and 
propagated. See Fig. 4a and Fig. 5. A person of ordinary skill in the art would have recognized 
that by using multiple memory tables, as opposed to a single table, the number of index bits used 
to access an entry in each bank would be reduced, resulting in a reduction in the amount of 
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hardware. Therefore, it would have been obvious to one of ordinary skill in the art at the time of 
the invention to use multiple branch history tables instead of a single table. 

2) In addition, it would have been obvious to one of ordinary skill in the art at the time of 
the invention to separate the single history table as taught by McFarling into multiple history 
tables since it has been held that making separable where needed is obvious. See Nerwin v. 
Erlichman , 168 USPQ 177 (1969). It should be realized that the single-table system serves the 
same purpose as the multiple-table system in that an index is applied and a corresponding entry 
is selected. Although multiple entries are selected in response to applying an index to multiple 
tables (in Applicant's system), the final result is still only a single entry, as taught by McFarling. 

b) it is inherent that McFarling' s system includes compare logic that is connected to the history 
table in order to determine the hit/miss indication. A hit indication is given when the branch 
instruction in question has a corresponding entry in the history table. Therefore, in order to give 
a hit signal, some logic must be used to compare the entries in the table with the current branch 
instruction. Since McFarling has not taught multiple history tables, it follows that he has not 
taught multiple compare logic units that are connected to those tables. However, if multiple 
tables were used, as discussed in part (a) above, then it would be inherent that multiple compare 
logic units would be connected to those tables in order to determine if a branch has a 
corresponding entry. 

c) McFarling has not taught a history multiplexer coupled to each of the plurality of tables to 
provide the history for the hit. However, since McFarling has only taught one table, there is no 
need to multiplex multiple values from multiple tables. On the other hand, if multiple tables 
were implemented in McFarling' s system, as described in part (a) above, then it would follow 
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that a multiplex system would be needed in order to choose one of the plurality of history values. 
This concept has been taught by Gochman in Fig.4a (component 320) and Fig.5. When each of 
the values are retrieved from the memories, the selection logic determines which value will be 
sent through. Therefore, if multiple tables were used in McFarling's system, in order to ensure 
only the correct history value is used to make the prediction, it would have been obvious to one 
of ordinary skill in the art at the time of the invention to use a multiplexer to perform the 
selection. 

d) McFarling has taught combinational logic to provide the second taken/not-taken prediction. 
See Fig. 10, and note the multiplexer which selects between either the local prediction or the 
bimodal prediction. However, since McFarling has not taught multiple history tables and a 
history multiplexer, it follows that he has not taught combinational logic that is connected to a 
history multiplexer to provide the second taken/not-taken prediction. It should be realized 
though that the combinational logic of McFarling takes in the second prediction as an input. If 
multiple tables and a history multiplexer were used, as discussed in parts (a) and (c) above, then 
it would follow that the combinational logic would be coupled to an output of the history 
multiplexer because the history multiplexer will be providing the second taken/not-taken 
prediction as an input to the combinational logic. Therefore, if multiple tables and a history 
multiplexer were used, it would have been obvious to one of ordinary skill in the art at the time 
of the invention to connect the combinational logic taught by McFarling to the output of the 
history multiplexer since the combinational logic uses the history information in order to 
generate a taken/not-taken prediction. 
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29. Claims 3 1 and 35-36 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
McFarling, as applied above, in view of Henry, as applied above. 

30. Referring to claims 31, McFarling has taught a method as described in claim 27. 
McFarling has further taught: 

a) logic to update a replacement field for said matching entry only if the first prediction is 
incorrect, indicating that the entry is used to make a prediction. It should be noted that if there is 
a miss in the local table, then the bimodal predictor (saturating counter predictor) will make the 
prediction. According to column 9, lines 45-50, if the bimodal prediction is already correct, then 
no replacement will occur to the local table. Therefore, replacements will only occur when the 
first prediction from the bimodal predictor is incorrect. When it is incorrect, an entry within the 
local table (a replacement field which includes a tag, history, and valid field as shown in Fig. 17) 
will be replaced with information corresponding to the mispredicted branch. Whatever entry is 
replaced is the matching entry as the information for the new branch will only be placed into a 
single entry. This entry would be determined by some replacement algorithm such as the least 
recently used (LRU) replacement algorithm. This is mentioned in column 8, lines 6-8. This 
update indicates that the matching entry was used to make a prediction for the previously 
associated branch because the entry is being replaced so it can no longer predict for its branch. 
For instance, assume there is prediction information for branch X in entry B of the table. 
Ultimately, if entry B is replaced with prediction information for branch Y, then this replacement 
indicates that the prediction information for branch X was used to make a prediction because it 
was used in the past. But, the information will no longer be used as it is being replaced. 
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b) McFarling has not explicitly taught logic to update a history field for said matching entry with 
the outcome of an executed branch instruction only if the first prediction is incorrect and the 
second prediction is correct. However, Henry has taught such a concept. See Fig. 6 and note that 
if the final (first) prediction is incorrect (step 624) and the non-selected (second) prediction is 
correct (step 628), then the corresponding history field within predictor 212 (Fig.2) will be 
updated (step 632). It should be noted from Fig.3 that component 212 does have history entries. 
Henry has further disclosed that such an update scheme advantageously yields improved branch 
prediction results. As a result, it would have been obvious to one of ordinary skill in the art at 
the time of the invention to modify McFarling to employ the update policy taught by Henry. 
3 1 . Referring to claim 35, McFarling has taught an apparatus comprising: 

a) means for providing an address of at least one instruction. See column 3, lines 12-16. Note 
that McFarling makes reference to a program counter, which performs the same function as an IP 
generator (i.e. they both provide an address at which an instruction will be fetched). 

b) means for providing a first taken/not-taken branch prediction based upon the current state of a 
state machine and responsive to the address. See column 9, lines 10-16. It has been disclosed 
that a bimodal predictor always provides a provisional prediction, because it is provided for use 
by the system but the system may not choose the prediction as the final prediction. Furthermore, 
from column 3, lines 1 1-28, the bimodal predictor is shown to be an array of saturated counters. 
Saturated counters are state machines in that they function differently when in different states. 
See Fig.2. 

c) local branch history prediction (LBHP) logic having an input coupled to the address providing 
means and capable of providing (1) a second taken/not-taken prediction at an output responsive 
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to the address resulting in a hit, and (2) a hit/miss indication for the address, wherein the LBHP 
logic includes at least one local branch history table to provide a taken/not-taken history in 
response to a hit. See column 9, lines 18-24, and Fig. 10. Also, in column 4, lines 15-19, 
McFarling has explained that the local history table is indexed based on a portion of the branch 
instruction address. Therefore, the local history table is also coupled to the IP generator. It 
should be realized that the LBHP includes at least one local branch history table to provide a 
taken/not-taken history in response to a hit. From Fig. 10, it can be seen that the tag/history data 
is stored in one table 100 and the prediction (CNT) is stored in a second table. When a hit in the 
local table occurs, the most significant bit of accessed CNT is used to make a second taken/not- 
taken prediction. 

d) means for generating without memory the second prediction based on the taken/not-taken 
history. It should be realized that the second prediction is generated as the final prediction by the 
multiplexer shown in Fig. 10 when there is a hit in the local table. This multiplexer is 
combinational logic without a memory that performs the decision function: 

pred = (pred B AND hit') OR (pred H AND hit) 

More specifically, the final prediction (pred) generated by the multiplexer is either the 
provisional prediction from the bimodal predictor (prede) if there is miss (hit') in the local 
history table or the second prediction from the local history table (pred^ if there is a hit in the 
local history table. And, combinational logic is just a logic circuit whose outputs depend only on 
the present logic inputs. Clearly, the multiplexer of Fig. 10, outputs a value which is dependent 
on its inputs (see the formula above). 
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e) a multiplexer having an input coupled to the outputs of the first prediction means and the 

LB HP logic and a select input coupled to receive the hit/miss indication and in response provide 
(1) the second prediction if there is a hit and (2) the first prediction if there is a miss. See Fig. 10 
and note that the multiplexer has inputs from both the SCBP (104) and LBHP (104) and that one 
of the inputs is chosen based on the hit signal that is used as a select line for the multiplexer. See 
column 9, lines 10-24 for further explanation. 

f) McFarling has not explicitly taught means for updating said history field, with the outcome of 
an executed branch instruction that is pointed to by said address, only if the first prediction is 
incorrect and the second prediction is correct. However, Henry has taught such a concept. See 
Fig. 6 and note that if the final (first) prediction is incorrect (step 624) and the non-selected 
(second) prediction is correct (step 628), then the corresponding history field within predictor 
212 (Fig.2) will be updated (step 632). It should be noted from Fig.3 that component 212 does 
have history entries. Henry has further disclosed that such an update scheme advantageously 
yields improved branch prediction results. As a result, it would have been obvious to one of 
ordinary skill in the art at the time of the invention to modify McFarling to employ the update 
policy taught by Henry. 

32. Referring to claim 36, McFarling in view of Henry has taught an apparatus as described 
in claim 35. Furthermore, claim 36 is rejected for the same reasons set forth in the rejection of 
claim 23. Note that hashing and encoding perform the same operation in that a starting value 
(branch instruction address) is transformed into another value (index to prediction logic). 
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33. Claims 32-33 are rejected under 35 U.S.C. 103(a) as being unpatentable over McFarling, 
as applied above, in view of Hennessy, as applied above. 

34. Referring to claim 32, McFarling has taught a processor as described in claim 27. 
Furthermore, the processor of claim 32 performs the method described in claim 25. Therefore, 
claim 32 is rejected for the same reasons set forth in the rejection of claim 25. 

35. Referring to claim 33, McFarling has taught a processor as described in claim 32. 
Furthermore, the processor of claim 33 performs the method described in claim 26. In addition, 
it should be realized from the rejection of claims 25 and 26 that the decode stage taught by 
Hennessy has the ability to determine a target address. See Figure 3.22 on page 163. Therefore, 
claim 33 is rejected for the same reasons set forth in the rejection of claim 26. 

36. Claim 34 is rejected under 35 U.S.C. 103(a) as being unpatentable over McFarling, as 
applied above, in view of Rahman et al., U.S. Patent No. 5,805,878 (as applied in the previous 
Office Action and herein referred to as Rahman). 

37. Referring to claim 34, McFarling has taught a processor as described in claim 27. 
McFarling has not explicitly taught that the address points to a cache line having a plurality of 
instructions. However, Rahman has taught the implementation of a cache with multiple 
instructions per cache line. See column 9, lines 17-21. Also, Rahman has suggested in column 
9, lines 23-37, that a benefit of such a feature would be to allow for simultaneous fetching of 
multiple instructions. This would result in less time spent making memory accesses and 
increased instruction-level parallelism, which would result in higher throughput. Therefore, in 



Application/Control Number: 09/535,765 Page 23 

Art Unit: 2183 

order to increase throughput, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to implement a cache that contains a plurality of instructions per cache line. 

38. Claim 37 is rejected under 35 U.S.C. 103(a) as being unpatentable over McFarling in 
view of Henry, as applied above, in view of Gochman, as applied above. 

39. Referring to claim 37, McFarling has taught an apparatus as described in claim 35. 
Furthermore, claim 37 is rejected for the same reasons set forth in the rejection of claim 30. 

40. Claims 38-39 are rejected under 35 U.S.C. 103(a) as being unpatentable over McFarling 
in view of Henry, as applied above, in view of Hennessy, as applied above. 

41. Referring to claim 38, McFarling has taught an apparatus as described in claim 35. 
Furthermore, the apparatus of claim 38 performs the method of claim 25. Therefore, claim 38 is 
rejected for the same reasons set forth in the rejection of claim 25. 

42. Referring to claim 39, McFarling has taught an apparatus as described in claim 38. 
Furthermore, the apparatus of claim 39 performs the method of claim 26. Therefore, claim 39 is 
rejected for the same reasons set forth in the rejection of claim 26. 

Conclusion 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to David J. Huisman whose telephone number is (571) 272-4168. 
The examiner can normally be reached on Monday-Friday (8:00-4:30). 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Eddie Chan can be reached on (571) 272-4162. The fax phone number for the 
organization where this application or proceeding is assigned is 703-872-9306. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). 

DJH 

David J. Huisman ^ , 

April 28, 2005 /f / ~ //^ 




